49 research outputs found
Scalable Recommendation with Poisson Factorization
We develop a Bayesian Poisson matrix factorization model for forming
recommendations from sparse user behavior data. These data are large user/item
matrices where each user has provided feedback on only a small subset of items,
either explicitly (e.g., through star ratings) or implicitly (e.g., through
views or purchases). In contrast to traditional matrix factorization
approaches, Poisson factorization implicitly models each user's limited
attention to consume items. Moreover, because of the mathematical form of the
Poisson likelihood, the model needs only to explicitly consider the observed
entries in the matrix, leading to both scalable computation and good predictive
performance. We develop a variational inference algorithm for approximate
posterior inference that scales up to massive data sets. This is an efficient
algorithm that iterates over the observed entries and adjusts an approximate
posterior over the user/item representations. We apply our method to large
real-world user data containing users rating movies, users listening to songs,
and users reading scientific papers. In all these settings, Bayesian Poisson
factorization outperforms state-of-the-art matrix factorization methods
Allosteric collaboration between elongation factor G and the ribosomal L1 stalk directs tRNA movements during translation
Determining the mechanism by which transfer RNAs (tRNAs) rapidly and
precisely transit through the ribosomal A, P and E sites during translation
remains a major goal in the study of protein synthesis. Here, we report the
real-time dynamics of the L1 stalk, a structural element of the large ribosomal
subunit that is implicated in directing tRNA movements during translation.
Within pre-translocation ribosomal complexes, the L1 stalk exists in a dynamic
equilibrium between open and closed conformations. Binding of elongation factor
G (EF-G) shifts this equilibrium towards the closed conformation through one of
at least two distinct kinetic mechanisms, where the identity of the P-site tRNA
dictates the kinetic route that is taken. Within post-translocation complexes,
L1 stalk dynamics are dependent on the presence and identity of the E-site
tRNA. Collectively, our data demonstrate that EF-G and the L1 stalk
allosterically collaborate to direct tRNA translocation from the P to the E
sites, and suggest a model for the release of E-site tRNA
Comparing Traditional and LLM-based Search for Consumer Choice: A Randomized Experiment
Recent advances in the development of large language models are rapidly
changing how online applications function. LLM-based search tools, for
instance, offer a natural language interface that can accommodate complex
queries and provide detailed, direct responses. At the same time, there have
been concerns about the veracity of the information provided by LLM-based tools
due to potential mistakes or fabrications that can arise in algorithmically
generated text. In a set of online experiments we investigate how LLM-based
search changes people's behavior relative to traditional search, and what can
be done to mitigate overreliance on LLM-based output. Participants in our
experiments were asked to solve a series of decision tasks that involved
researching and comparing different products, and were randomly assigned to do
so with either an LLM-based search tool or a traditional search engine. In our
first experiment, we find that participants using the LLM-based tool were able
to complete their tasks more quickly, using fewer but more complex queries than
those who used traditional search. Moreover, these participants reported a more
satisfying experience with the LLM-based search tool. When the information
presented by the LLM was reliable, participants using the tool made decisions
with a comparable level of accuracy to those using traditional search, however
we observed overreliance on incorrect information when the LLM erred. Our
second experiment further investigated this issue by randomly assigning some
users to see a simple color-coded highlighting scheme to alert them to
potentially incorrect or misleading information in the LLM responses. Overall
we find that this confidence-based highlighting substantially increases the
rate at which users spot incorrect information, improving the accuracy of their
overall decisions while leaving most other measures unaffected
Recommended from our members
Graphical models for inferring single molecule dynamics
The recent explosion of experimental techniques in single molecule biophysics has generated a variety of novel time series data requiring equally novel computational tools for analysis and inference. This article describes in general terms how graphical modeling may be used to learn from biophysical time series data using the variational Bayesian expectation maximization algorithm (VBEM). The discussion is illustrated by the example of single-molecule fluorescence resonance energy transfer (smFRET) versus time data, where the smFRET time series is modeled as a hidden Markov model (HMM) with Gaussian observables. A detailed description of smFRET is provided as well.
The VBEM algorithm returns the model’s evidence and an approximating posterior parameter distribution given the data. The former provides a metric for model selection via maximum evidence (ME), and the latter a description of the model’s parameters learned from the data. ME/VBEM provide several advantages over the more commonly used approach of maximum likelihood (ML) optimized by the expectation maximization (EM) algorithm, the most important being a natural form of model selection and a well-posed (non-divergent) optimization problem.
The results demonstrate the utility of graphical modeling for inference of dynamic processes in single molecule biophysics
Quantification of Cell Movement Reveals Distinct Edge Motility Types During Cell Spreading
Actin-based motility is central to cellular processes such as migration, bacterial engulfment, and cancer metastasis, and requires precise spatial and temporal regulation of the cytoskeleton. We studied one such process, fibroblast spreading, which involves three temporal phases: early, middle, and late spreading, distinguished by differences in cell area growth. In these studies, aided by improved algorithms for analyzing edge movement, we observed that each phase was dominated by a single, kinematically and biochemically distinct cytoskeletal organization, or motility type. Specifically, early spreading was dominated by periodic blebbing; continuous protrusion occurred predominantly during middle spreading; and periodic contractions were prevalent in late spreading. Further characterization revealed that each motility type exhibited a distinct distribution of the actin-related protein VASP, while inhibition of actin polymerization by cytochalasin D treatment revealed different dependences on barbed-end polymerization. Through this detailed characterization and graded perturbation of the system, we observed that although each temporal phase of spreading was dominated by a single motility type, in general cells exhibited a variety of motility types in neighboring spatial domains of the plasma membrane edge. These observations support a model in which global signals bias local cytoskeletal biochemistry in favor of a particular motility type
Recommended from our members
Lateral Membrane Waves Constitute a Universal Dynamic Pattern of Motile Cells
We have monitored active movements of the cell circumference on specifically coated substrates for a variety of cells including mouse embryonic fibroblasts and T cells, as well as wing disk cells from fruit flies. Despite having different functions and being from multiple phyla, these cell types share a common spatiotemporal pattern in their normal membrane velocity; we show that protrusion and retraction events are organized in lateral waves along the cell membrane. These wave patterns indicate both spatial and temporal long-range periodic correlations of the actomyosin gel